Exercise 1: biological sequences manipulation
using object-oriented programming, can you write a series of classes to handle biological sequences and read/write them in Fasta format? Can you add the functionality to check for the correct alphabet being used? Bonus points for writing simple sequence manipulation methods.
You can use the ../data/proteome.faa and ../data/genome.fasta as input files.
Hint: a biological sequence is an abstract concept, nucleotide and protein sequences are more specific.
In [ ]:
Exercise 2: graph theory, manipulation and analysis
Can you write a series of classes/methods to represent and analyse biological networks? You would probably need a Node and Edge class, and a Graph container class. But you are free to implement it however you see fit. Some methods to compute each node's degree and extract the graph's connected components should be implemented as well.
You can use the E. coli string network as an example; you can download it here. You can decide to parse the whole network, or to filter for high scoring interactions (i.e. > 800).
In [ ]: